AITopics | out-of-distribution problem

Bidirectional Learning for Offline Infinite-width Model-based Optimization Can (Sam) Chen

Neural Information Processing SystemsFeb-11-2026, 16:13:20 GMT

The code is available here.

artificial intelligence, high-scoring design, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.93)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Bidirectional Learning for Offline Infinite-width Model-based Optimization

Neural Information Processing SystemsDec-25-2025, 03:51:25 GMT

In offline model-based optimization, we strive to maximize a black-box objective function by only leveraging a static dataset of designs and their scores. This problem setting arises in numerous fields including the design of materials, robots, DNAs, proteins, etc. Recent approaches train a deep neural network (DNN) model on the static dataset to act as a proxy function, and then perform gradient ascent on the existing designs to obtain potentially high-scoring designs. This methodology frequently suffers from the out-of-distribution problem where the proxy function often returns adversarial designs.

bidirectional learning, offline infinite-width model-based optimization, static dataset, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.59)

Add feedback

The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations

Neural Information Processing SystemsDec-23-2025, 20:28:06 GMT

Feature importance (FI) estimates are a popular form of explanation, and they are commonly created and evaluated by computing the change in model confidence caused by removing certain input features at test time. For example, in the standard Sufficiency metric, only the top-k most important tokens are kept. In this paper, we study several under-explored dimensions of FI explanations, providing conceptual and empirical improvements for this form of explanation. First, we advance a new argument for why it can be problematic to remove features from an input when creating or evaluating explanations: the fact that these counterfactual inputs are out-of-distribution (OOD) to models implies that the resulting explanations are socially misaligned. The crux of the problem is that the model prior and random weight initialization influence the explanations (and explanation metrics) in unintended ways.

explainability and search method, explanation, out-of-distribution problem, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.56)

Add feedback

bd391cf5bdc4b63674d6da3edc1bde0d-Paper-Conference.pdf

Neural Information Processing SystemsAug-18-2025, 10:09:44 GMT

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.93)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Bidirectional Learning for Offline Infinite-width Model-based Optimization

Neural Information Processing SystemsJan-18-2025, 18:22:06 GMT

In offline model-based optimization, we strive to maximize a black-box objective function by only leveraging a static dataset of designs and their scores. This problem setting arises in numerous fields including the design of materials, robots, DNAs, proteins, etc. Recent approaches train a deep neural network (DNN) model on the static dataset to act as a proxy function, and then perform gradient ascent on the existing designs to obtain potentially high-scoring designs. This methodology frequently suffers from the out-of-distribution problem where the proxy function often returns adversarial designs. BDI consists of two mappings: the forward mapping leverages the static dataset to predict the scores of the high-scoring designs, and the backward mapping leverages the high-scoring designs to predict the scores of the static dataset.

high-scoring design, offline infinite-width model-based optimization, static dataset, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)

Add feedback

The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations

Neural Information Processing SystemsOct-9-2024, 17:21:06 GMT

Feature importance (FI) estimates are a popular form of explanation, and they are commonly created and evaluated by computing the change in model confidence caused by removing certain input features at test time. For example, in the standard Sufficiency metric, only the top-k most important tokens are kept. In this paper, we study several under-explored dimensions of FI explanations, providing conceptual and empirical improvements for this form of explanation. First, we advance a new argument for why it can be problematic to remove features from an input when creating or evaluating explanations: the fact that these counterfactual inputs are out-of-distribution (OOD) to models implies that the resulting explanations are socially misaligned. The crux of the problem is that the model prior and random weight initialization influence the explanations (and explanation metrics) in unintended ways.

explainability and search method, explanation, feature importance explanation, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.92)

Add feedback

Is Programming by Example solved by LLMs?

Li, Wen-Ding, Ellis, Kevin

arXiv.org Artificial IntelligenceJun-13-2024

Programming-by-Examples (PBE) aims to generate an algorithm from input-output examples. Such systems are practically and theoretically important: from an end-user perspective, they are deployed to millions of people, and from an AI perspective, PBE corresponds to a very general form of few-shot inductive inference. Given the success of Large Language Models (LLMs) in code-generation tasks, we investigate here the extent to which LLMs can be said to have `solved' PBE. We experiment on classic domains such as lists and strings, and an uncommon graphics programming domain not well represented in typical pretraining data. We find that pretrained models are not effective at PBE, but that they can be fine-tuned for much higher performance, provided the test problems are in-distribution. We analyze empirically what causes these models to succeed and fail, and take steps toward understanding how to achieve better out-of-distribution generalization. Collectively these results suggest that LLMs make strong progress toward solving the typical suite of PBE tasks, potentially increasing the flexibility and applicability of PBE systems, while also identifying ways in which LLMs still fall short.

international conference, language model, llm, (13 more...)

arXiv.org Artificial Intelligence

2406.08316

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Alaska (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
(3 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Gradient-free neural topology optimization

Kus, Gawel, Bessa, Miguel A.

arXiv.org Artificial IntelligenceMar-7-2024

Gradient-free optimizers allow for tackling problems regardless of the smoothness or differentiability of their objective function, but they require many more iterations to converge when compared to gradient-based algorithms. This has made them unviable for topology optimization due to the high computational cost per iteration and high dimensionality of these problems. We propose a pre-trained neural reparameterization strategy that leads to at least one order of magnitude decrease in iteration count when optimizing the designs in latent space, as opposed to the conventional approach without latent reparameterization. We demonstrate this via extensive computational experiments in- and out-of-distribution with the training data. Although gradient-based topology optimization is still more efficient for differentiable problems, such as compliance optimization of structures, we believe this work will open up a new path for problems where gradient information is not readily available (e.g. fracture).

design subset, optimization, optimizer, (17 more...)

arXiv.org Artificial Intelligence

2403.04937

Country:

North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > District of Columbia > Washington (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

A 10,000-foot view of AI

#artificialintelligenceDec-2-2021, 09:00:43 GMT

Yannic gave up on reading every interesting arXiv paper he could find a while ago. Today, he cites a combination of the arXiv, Reddit and Twitter as the main sources of ML-related news that he pulls from to stay up-to-date. It's no secret that media coverage of AI can be overblown (e.g. Sophia, the first android with citizenship, now wants to have a robot baby), one-dimensional and negatively slanted (Flasehoods more likely with large language models). But Yannic highlights a couple of specific tropes and themes that he's come to be particularly skeptical of.

000-foot view, ais, algorithm, (16 more...)

#artificialintelligence

Industry: Media > News (0.58)

Technology:

Information Technology > Communications > Social Media (0.81)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.55)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.35)

Add feedback

Extending LIME for Business Process Automation

Upadhyay, Sohini, Isahagian, Vatche, Muthusamy, Vinod, Rizk, Yara

arXiv.org Artificial IntelligenceAug-9-2021

AI business process applications automate high-stakes business decisions where there is an increasing demand to justify or explain the rationale behind algorithmic decisions. Business process applications have ordering or constraints on tasks and feature values that cause lightweight, model-agnostic, existing explanation methods like LIME to fail. In response, we propose a local explanation framework extending LIME for explaining AI business process applications. Empirical evaluation of our extension underscores the advantage of our approach in the business process setting.

application, explanation, lime, (15 more...)

arXiv.org Artificial Intelligence

2108.04371

Country: